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HUMAN NUCLEIC ACID FRAGMENTS, ISOLATED FROM BRAIN 
ADRENAL TISSUE, PLACENTA OR BONE NARROW 

-„., inventxcr. relate, to new nucleic acid fragments encoding gene 
oroducts or portions thereof, which fragments are obtainable fro, human 
'nucleic acid populations, individual rubers of such populations be,r.g 
present in widely varying amounts. 

Situations are increasingly arising in. which it is necessary to stuoy 
5 complex nucleic acid or polynucleotide populations. For example, it is 
now widely appreciated that an invaluable resource could be created if 
th« entire sequence of the genomes of organisms such as rr.an were 
determined and the information available. The magnitude of such a task 
should not, however, be underestimated. Thus, the human genome may 
10 contain as many as 100, COO genes (a very substantial proportion cf 
which rr.ay be expressed m the hum. an brain (Sutcliffe, Ann. Rev. 
Seurc.Ci. 11:157 C966H ]• Only a very small percentage of the stock 
of human genes has presently beer, explored, and this largely i .-. a 
piecemeal end usually - :e: - f x ca 1 1 y targeted fasr.icn. 

1 S 

There has been much public ceoate acout the nest means of approaching 
human genorr..- sequencing. Brenner has arguei (CI3A Foundation Syr-pcsi-m 
:49:t (19S0)) that efforts should re concentrated on cDNA.s produced 
from reverse transcribed rr.RNAs rather than on genomic CNA. This is 

20 cr.rr.arUy because most useful genetic information resides in the 
fraction of the genome which corresponds to rr.P.KA, and this fraction is 
a very small part of the total (51 or less). Moreover, techniques for 
cenerating cDNAs are also well known. On the other hand, ever, 
supposing near perfect recovery cf cDNAs corresponding to all expressed 

25 nHNAs, some potentially useful information will ce lost by the cDNA 
approach, including sequences responsible for control and regulation of 
genes. Nonetheless, the cDh'A approach at least substantially reduces 
the inherent inefficiencies resulting from analysis of repeated 
sequences or non-coding sequences m an approach which depends upon 

30 cencTuc DN A £ e q-i c nc i r.q . 

Recently the results of a rapid method for loe-.tifymg and 
characterising re- cDN.-.s has oeen reported ;?.dars, K.D. et al.. Science 
251, 1991. r-P 14S1-:««|. Essentially, a semi-automated sequence 
35 reader was used to produce a single read of sequence from ono end cf 
each of a numcer of cDNA.s picked at random. It was shown, by comparing 
the nucleic acid sequences of the cDNAs «cr the protein sequences 
crduced by translating the nucleic acid sequence cf the -l.'As) to each 
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other and to ►^.ov.n eequcnces in public databases, that each cf the 
cDNAs picked at random, could be unambiguously classified. The cDNAs 
could be classified as being either entirely new or as corresponding, 
to a greater cr leaner extent, to a previously known sequence. cDNAs 
5 identified in this way were further characterised and found to be 
useful in a variety of standard applications, including physical 
mapping. Unfortunately, such a process is insufficient. The longer 
the process is pursued with any given population cf cDNAs the less 
efficient it becomes and the lower the rate of identification of new 
10 clones. In essence, as the nurrJber of cDNAs which have already been 
picked rises, the probability of picking a particular cDNA more than 
once increases. This difficulty is exacerbated by the wide range of 
abundanciea at which different cDNAs can occur, which abundancies can 
vary by several orders of magr ituc*3 . Thus, whereas scrre sequences are 
IS exceedingly rare, a single cDNA type may comprise as much as 10% of the 
population of cDNAs produced from a particular tissue (Lewin, 3. Gene 
Expression, Vol. 2: Eukaryotic Chromosomes, 2nd ed., pp. 708-719. New 
York: Wiley, 1980). The need to avoid missing rarer species in any 
given population presents a considerable problem. 

20 

Various approaches ^no-called "normalisation" techniques) have been 
tried in addressing the problem of increasing the efficiency of 
examination of a mixed nucleotide population, for example, such a 
population as is to be examined in human genome sequencing. 

25 

Thus, a standard PGR protocol can be used to amplify selectively cDNAs 
which are present at extremely low levels, if there is information 
a be t the sequence cf those cDNAs. If not, a primer specific to the 
des red cDNA cannot be constructed and the desired cDNA cannot be 
30 selectively amplified. The standard PCR method is therefore inadequate 
if it is desired to characterise a number of unknown genes. 

A second approach involves hybridization of cDNA to genomic DNA. At 
saturation, Lne cDN'As recovered from gencmic/cDNA hybrids will be 

35 present in the sarnie abundance as the genes encoding them. This will 
provide a much more homogenous population than the original cDNA 
library, but does not entirely solve the problem. In order to reacn 
saturation in respect of the very rare sequences, it will be necessary 
to use huge quantities of cDNA , which need to be allowed to anneal to 

40 large amounts of genomic DNA over a considerable periods cf t im.e . 

Furthermore, cDNAs which are homologous to genes which are present in 
multiple rep:*" p in the cenom-p will bp over-represented. 
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third approach exploits the second order reassociation kinetics of 
cDNA annealing to itself. After a long period of annealing, the cDNAs 
which regain single s-.randed will have nearly the same abundance, and 
can be recovered by standard PCR (see Patanjali, S.H. et al., PNAS USA 
88, 1991, p P 1943-1947; Ko, M.S.H., NAR 19, no. 18, 1991, PP 5705- 
5711). The methods disclosed in these two publications, however, 
suffer from notable disadvantages. They are entirely dependent on the 
stringent physical separation of single stranded and double stranded 
DNA, require an elevated nurrier of manual manipulations in each 
reaction, and necessitate protracted hybridisation times (up to 288 
hours in the method of Patanjali et al.) 



Yet a further approach in "normalising" a nucleotide population is 
described in co-pending British Prtent Application No 91 15407.0, filed 
15 17th July, 1991 by KRC, and involves a PCR process in which a mixture 
comprising a heterogenous DNA population and appropriate 
oligonucleotide pri-.ers is first formed and the DNA denatured, but 
before effecting a conventional PCR protocol the conditions are altered 
to allow the denatured strands of the more common DNA species to 
reanneal together, whilst avoiding annealing of pr.mers to the DNA 
strands. By this means, rarer species can subsequently be amplified in 
preference to the more com-mon species. 



This PCR normalisation method in general comprises the steps of: 

( a) preparing a mixture comprising a heterogenous DNA population and 
oligonucleotide primers suitable for use in a PCR procejs, in which the 
DNA is denatured ; 

(b) altering the conditions to allow the denatured strands of the 
more common DNA species to reanneal, while preventing the annealing to 
-he primers to the DNA strands; 

(c) further altering the conditions of the mixture in order to allow 
35 the primers to anneal to the remaining single-stranded DNA comprising 

the rarer DNA species; and 

(d) carrying out an extension synthesis in the mixture produced in 
step (c) . 



Advantageously, the method consists of a cyclic application of the 
above four steps. 
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It will be appre-.ai.ed that the conditions may be altered by tru 
alteration of the -emperature of the reaction mixture. However, any 
conditions which affect the hybridisation of complementary DNA strands 
to cne another rray be varied to achieve the required result. 

5 

Because the rear.nea i ing efficiency of any given DNA species will depend 
on the product of its concentration and time, the r.ore abundant the 
sequence the greater the extent to which it will reanneal in any given 
time period. Once a DNA species has reached a certain threshold 

10 concentration it will no longer be amplified exponentially, as a 
significant amount will have annealed to the double stranded form 
before the priming step. Thus, as each individual DNA species is 
amplified by the process to its threshold concentration, the r?te of 
amplification of that species w;ll start to tail off. Eve-*, .ally, 

IS therefore, all DNA species will be present at the same concentration. 

The length of t:.c- itannealing step will determine hew much DNA is 

present at the t\reshold concentration. Preferably, therefore, z^.e 

duration of the reannealing step will be determined empirically for 
2 0 each DNA population . 

In the PCR normalisation process m general, the DNA primers may be 
adapted to prime s?lectively a sample of the total DNA populaticn. 3y 
using primers which will only prime a sample of the population, only 

25 that sample will be amplified ana normalised. The total quantity of 
DNA generated will thereby be reduced, which means that the cycling 
times can be kept lew. This ensures that the m.ethod is applicable to 
complex DNA populations such as cDNA populations. In addition, a first 
primer can be used which is adapted selectively to prime a sample cf 

3C the total cDNA populatic and a second primer which is a general 
primer. Advantageously, the general primer is cligo dT (each primed 
-DNA will then be replicated in its entirely, as the oligo dT primer 
will anneal to the poly-A tail at the end of the cDNA) . 

35 In co-pending British Patent Application No 92 14873.3, filed by K?,C 
13th July, 1992, a new process is cescribed which allows the study and 
identification cf the individual members o i a mixed or heterogenous 
population of nucleotide sequences perhaps of varying abundance. In 
preferred embodiments of the said procs?^, the starting nucleic acid 

40 population is tr^ited by: 

(a) subjecting the nucleic acid - o tr.e a::::r cf a reagent, 
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preferably an endcnuclease which has its cleavage and recognition sites 
separated, which reagent cleaves the nucleic acid so as to produce 
double stranded cleavage products the individual strands of which 
overlap at cleaved ends to leave a single strand extending to a known 
5 extent; 

(b) ligating the cleavage products from (a) with a population of 
adaptor molecules to generate adaptored cleavage products, each of 
which adaptor molecules has a cleavage product end recognition sequence 

10 and the population thereof encompassing a range of adaptor molecules 
having recognition sequences complementary to a predetermined subset of 
the sequences of the cleavage-generated extending single strands; and 

(c) selecting and separ ting orly those adaptored cleavage products 
15 resulting from (b) which carry an adaptor of predetermined recognition 

sequence . 



20 



25 



A preferred endonuc lease for use in step (a) cf the above process 
Fok 1. 

An important feature of th.s process is the use cf adapter molecules. 
The adaptors used must have "overhanging" fragment recognition 
sequences which reflect cr are complementary to the extending cleavage- 
derived sequences which the adaptors are designed to react with. It is 
also preferred that the adaptors used should end with a 5' hydroxyl 
group. The avoidance of a 5 ' phosphate group removes tne risk of 
inapprcpr iat- e ligation involving the adaptors. 



30 



35 



Adaptor molecules may also contain a portion permitting specitic 
sequence selection and separation (as in step (c) of the process) when 
a sequence is attached to the adaptor. For example, an adaptor can 
-arry biotin, thereby permitting advantage to be taken of the 
biot in/avidin 

reaction in selecting and separating desired adaptored molecules. 
Additionally, adaptors preferably comprise a known and selected 
sequence such that spec if ..cal ly isolated adaptcred molecules can be 
amplified by known techniques (such as ?CR) using a primer 
complementary to the cere sequence. 



40 Preferably the adaptors are short double-stranded oligonucleotides 
which can oe joined to the ends of cleavage products. They will nave 
been chemically synthesised so that their sequence can :e predetermined 
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and so that large concentrations can be easily produced. They nay also 
ce cherr.ical 1- - r:d::.ed in a way which allows then to be easily purified 
during the process. As renticned above, ideally their . ends will be 
unphesphory I a t ed so that once joined to fragments tne adaptcred end of 
5 the latter will no longer be able to participate m furtner ligation 
reactions. 

It is preferred that the adapter cleavage product end recognition 
sequences are cn the 5' end of the longest oligonucleotide strand 

10 r.aking up the preferred adaptor r.olecules, are at least 3 nucleotides 
in length and with totally random bases at the single-stranded 
position(s) two nucleotides in frorr. the 5' end. This then allows 
selection to be performed both during the joining reaction and during 
subsequent pricing reaction^. Then, because the final degree of 

15 selection is a result of the product of the degrees cf selection 
achieved at t^.ese two stages, raxirr.uT, selection can be achieved per 
adapt or/pri'-cr available. 

Adaptor strand extensions cn the 5' en a of tne longest oligonucleotide 
2 0 also facilitate the use cf r.odified oligonucleotides for separation 
purpose*" Preferably, the short oligonucleotide will be codified at 
its 5 ' end. This has the double benefit cf requiring yust one codified 
oligonucleotide for ail possible s mg le- stranded extensions that are 
used, and also placing the codification at a position where it cannot 
2 5 interfere with ligation cr subsequent pricing react: cr.s. 

Although only one type cf adapter is required per ligation reaction, it 
is preferred that acaptcrs covering all possible reactions m a chosen 
subset of sequences be present, because then tne opportunity for 

30 fragments in the chosen subset to ligate to each other is cimc.iEed. 

It is also preferred that the chosen specific adaptor, carrying a 
predetermined recognition sequence, should not only be different fror, 
the other adaptors in its s mg 1 e- stranded extension, out also different 
in the rest cf its sequence since this allows orientation to be 

35 introduced which is useful in subsequent steps. It is therefore also 
preferred that this adaptor has a codified oligonucleotide to 
facilitate its separation with tne cleaving products :: w n i c h it -cms. 

The above "acaptcrmg" process can be used to generate categories cr 
40 subsets cf sequences by rrakvng sc~e cf the adaptors specific m sore 
way, and s e 1 e c t * n g and separating ~ - in step ( c ) . In t h _ s way subsets 

c f sequences can o e provided copending u p c n the spe:.::c adapter 
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chosen, e.g. for use in subsequent nucleotide sequencing. This 
facilitates, fcr example, the identification of a large population of 
sequences by permitting a rational approach to splitting such 
populations ir.:c subsets, each of which subsets can be examined in 
turn. 

In the light of these developments, the present invention now provides 
a nucleic acid fragment encoding a gene product or portion thereof and 
con. prising any one of:- 

(a) a sequence selected from SEQ ID Nos 1 to 1193; 

(b) an allelic variation of a sequence as defined in (a); or 



15 (c) a sequence complementary to (a) -or (b). 



20 



In another aspect, the invention provides a nucleic acvd sequence as 
set out in any one of SEQ ID Nos 1 to 1193, or a complement or allelic 
variation thereof. Preferred sequences exhibit no more than 90^ 
homology to a human sequence known per se. 



In a further aspect, the invention provides a nucleic acid fragment 
comprising a portion of a sequence as defined above of sufficient size 
such that a probe of the same size and exhibiting complementarity to 

25 said portion can hybridise to said sequence. Preferably, such portions 
are at least 15 bases in length. It will be appreciated thar minor 
mismatches in whe aforesaid "complementarity" are not excluded provided 
hybridisation can still occur. In general, hybridisation conditions 
are within the choice of the skilled person, but reference can be made, 

30 for example, to the following: Melting temperature of hybrids - 
Bolton, E. T. and McCarthy, B. J. Proc. Natl. Acad. Sci, 48 p!390 
(1962). Effect cf formamide on lowering melting temperature - Casey, 
J. and Davidson, N., Nucleic Acids Res. 4, pl539 ( 1 977;. Effect of 
imperfect homology - 3onner, T. I. et al., J, Mol. Biol. 61, p!23 

35 (1973). General - Meinkoth, J. and Wahl, G . Anal. Bicchem . 13S, p267 



(1984). Oligo hycndization and washing - Lathe, R . 
PI { 1985 ) . 



Mol. Biol. 1S3, 



The present 



:r.ver.t icn 



env isage ; 



DKA 



compr i s ir.g 



4 0_ ? if j^S* 71 $l)ts or sequenc es as __r e f e r r e d_ 

r e g u 1 ato ry sequence. 



control 
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The invention includes such DNA constructs using a gene S) sten known in 
th_- art ligated t- a ^quence cr fragment of the invention so as to 
enable, upon expre?s ; cn, the provision of a fusion polypeptide. 
Preferably, an endopept idase recognition site is provided such that 
5 when the sequence cr fragment js expressed it is expressed in frame 
with a known protein with the boundary beir.g a cleavage site for an 
endor >pt idase with a rare cutting site. The known protein can then bo 
affinity purified, and the peptide corresponding to the fragment cr 
sequence in accordance with the invention may be released by tne 
10 endopept idase . Alternatively, the whole protein can be used to raise 
antibodies which can then be screened for those directed at polypeptide 
correspond ing to the fragment or sequence of the inven.ion. 

Since the present fragments arid sequences can be used to produce, inter 
15 alia, corresponding genes, whether by isolating them, by svr.thesis or 
otherw : se, such use and the resulting 0 N A fragments comprising genes 
are further aepoctr of the invention. 

'/et another aspect of the invent v^n is an expression vector comprising 
20 a fragment, sequence, gene-comprising DKA fragment, cr DNA construct, 
as above, positioned uuch that that nucleic acid sequence which 
encodes the polypeptide corresponding to said fragment, sequence or ON A 
fragment is in cperable raiding frame with a control or regulatory 
sequence . 

25 

Other aspects of the invention are host cells incorporating a seouence, 
or fragment, cr gene -compri sing DKA fragment, or UNA construct, as 
above, as a heterologous part of the expressible genetic information of 
the cell. The production of such modified host ceils can be achieved 
30 using methods known in the art. Such modified host cells can be used 
to express corresponding proteins, and these materials lend themselves 
in turn to the preparation of corresponding monoclonal cr polyclonal 
antibodies using standard techniques. 

35 Also included in this invention are such antibodies. Reference can be 
made, inter alia, to the following literature: Monoclonal antibodies, 
Cambell, A. M. laboratory Techniques in 3 iccnem is t ry , and Molecular 
Biology Ed. Burden, R. H. and van Knippenberg, ?. H. vol 13. Elsevier 
Amsterdam 1954. Godmg, J. W., Monoclonal antibodies: Principles and 

40 Practice, 2nd Edition, academic Press, London 1966. Kipps, T . J. and 
Kerzenberg, L. A. , Handbook of Experimental Immunology : Atolications 
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D. M. et al., plC8 Blac^well scientific Publications, Oxford. Harlow, 
E and Lane, D. Antibodies, A Laboratory Manual, Cold spring harbor 
Laboratory, Cold Spring, New York. 

5 Expression in an appropriate higher eucaryotic host may be important to 
ensure correct protein folding and also activity. Expression to avoid 
copurif icat ion of toxic products can sometimes be better performed in 
organisms approved for human consumption, eg prokaryotic Bacillus 
subtilis, eurkaryctic yeast, mammalian cows milk vectors, and other 
10 methods known in the art. 

The invention also includes novel gene products or portions thereof 
encoded by a fragment, sequence or cene-compr is ing DNA fragment of the 
i nvent ion . 

15 

It will be appreciated that the sequences of the present invention 
collectively have utility based, inter alia, upon their common origin, 
and hence they can effectively be considered together rather than as 
separate entities. It is convenient to represent them as separate 

20 sequences, because this is how they were produced and serves as 
"punctuation" betweer. the different functional entities which each 
sequence represents. However, the sequences could just as easily have 
been presented as a continuous sequence derived by placing them end to 
end in the order in which they were produced, with a separate 

25 indication of where the beginnings and ends of the component sequences 
are . 

In contrast to investigations hitherto, where gt;ne fragments (sequence 
fragments) could only be identified through some known characteristic 

30 [for example: their homology to a fragment v^ich largely encodes amino 
acids identified by sequencing a previously isolated peptide or is the 
antisense of that coding sequence; or them having at least partial 
homology to previously characterised nucleic acids; or them having 
ability to encode expressed proteins which could later be detected by 

35 functional assays of the cells expressing those proteins or by using 
antibodies which had been previously raided against the proteins to 
detect their expression, Sambrook J., et al., Molecular Cloning CSH 
Press 1989], the sequences and fragments described by the present 
invention are entirely underivable and unpredictable from the prior 

40 art, but are nonetheless clearly cf great value for various purposes. 

Thus, such sequences, by comparing them to sequence databases, can be 
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t h e existence of 



10 



15 



2 0 



25 



existing gene : 

genes were known, and new genes 
(Karli.n, S. and Altschui, S. 7. Pre:. 
( 198C) ) . 
genes an: 



new meters of 



used as a r^ans for determmm: 

e -' r,ew h -"- an genes when previously only non-human 
new qer.es when previously no genes were known 
Natl. Acad. Sci . 67 p226n-226S 
n al^ cases, this allows tne isolation of the corresponding 
their products, and hence enables the manufacture of 
rolecules of potential biological interest by recombinant means. 
Screening libraries cf known materials or hitherto unexplored source 
materials for biological efficacy is now an important industrial 
activity in the search for new therapies and therapeutics. w.ien new 
sequences have already been found to have counterparts m gene families 
or in non-human genes then knowledge about biological efficacy may 
already be appsrant. For example, new re-ptors or receptor 
agonists/antagonists may exhibit differences t. known instances of 
theoe molecules, and such differences could make them more suitable as 
therapeutics by, for example, exhibiting binding characteristics which 
are more m .ee-.ng with avoidnnce cf toxicity. Reference can oe made, 
for example, to polymorphic dopamine receptors and the implications for 
m.entai health (Iversen, Nature 353, p!C9 (1992), and Van ?ol, H. K 
H. et al., Nature 35£, F Ih9-152 (1992)). Where absolutely required, 
realisation cf ful l length cDNAs for expression can te achinv H !r.> 



u s i n o 



sequences 



5Creen L^l hybridisation.) suitable m- ;; 



libraries containing full ienqth clones ( D ' Aless ion , J. y . , 8 t al., 
Focus (Gi^co B.R.L, 9 ol (19S7)). Alternatively, the sequences can be 
used to design primers suitable for obtaining the missing sequences by 
PC? or other amplification methods (Frcnman, y. . A. ; Dush, H. K . and 
Marr.m, G. P . , ?rcc. Nat: . Acad . s _ , . £5 pS g 58 - 9002 , 292 3)). 



30 



Appropriate use cf the sequence fragments in antisense or triple helix 
(Griffin et ai., Science 245 ?967-971 (1989)) applications' will be 
useful for identifying manipulable targets related to disease. For 
example, viruses have been inhibited by antisense RNA to their mP.NAs 
(Chang, L-j., and Stoitfuc, c. y. j. Virci. ? 921-97< (1937)). A 
similar effect cculd te achieved by targeting the expression of 
cellular proteins wnicn are essential for growth or maintenance of the 
virus. 



Partial 



:n c Z NA s 



a v e crea: 



u 1 1 . 1 1 v once expressed . 



manner of expression can be selected by one skilie 
40 the intended application. Expression cf full lenoth tUNAs > 
required for b.ological activitv 
e u c a r v c 1 1 c 



: s t s may c c selectee as the 
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e art to suit 
s typically 
an z lower or higher 
express::-, and hicher 
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eucaryc:es r. = y s i preferred to ensure correct n:d:f icaticns, for 
example, g 1 y cc s y 1 a t i or i.n vivo, '..'her. this proves :c oe im.pcrtar.t. 
Expression c^r. re err, jred ry situating tr.e cDNA appropriately to 
signals f c r expression (r.-ar.r, ( E . a r. d 3 r o s i u s , J . Gene 4 0 p J £ 3 195 5), 
5 Shim.uzu, Y et al., Gene 65, pi 41 f 1 9 8 3 ) , Straus, 3. anj Gi Inert, W. 
Prcc. Natl. Acad. Sci. 32, p2C14 (1925)). Sucn signals nay include a 
promoter for transcription, which ray itself be regulatable. 

Tiie proteins thus-expressed can be screened for activities of 
10 therapeutic or commercial value. It may be that the proteins have to 
be first isolate: for this purpose or can be assayed in. situ. It may 
be desirable that so.me means of stabilising the expressed protein is 
employed. This can be achieved, for example, {and as indicated 
earlier) by expressing i n frame a , part of a fusion po Ivrept id'e, (Smith, 
15 D. 3., et al., Prcc. Natl. Acad. Sci. 52 p=C73 (19S6)). 

Useful in ti be dies m:i Oe raided against tr.e expressed oroteir. s. Zt is 
coTmonly net an absolute recu : r ener.t tnat full length p:c:e;r.!: are 
produced, altnough this may influence the quality of the ar.tioc.dies 

20 produced. Peptioes as sncrt as 3 or 9 amine- acid s length con re- 

used as art-gens /Germ- in ?. . , N. Nature ^35 pp605-£07 (199 1), 7{^zer.7>k-/ . 
A., Y., et a!., Nature 553 p522-627 (1991)). Immunogenic peptioes 
could simply be rynthesised using the ammc-acid sequence translate d 
from a sequence or fragment of this invention. It is desirable, 

2 5 although not absolutely required, that some m.eans of producing purified 
antibodies i s adepted. Wne n fusion poly peptides are used to raise 
antibodies, an affinity matrix specific for the generic part cf the 
protein allows the fusion polypeptide to be imm.oo 1 1 i sec (Srmth , 5. E . , 
et al., Prcc. Natl. Acao. Sci. 53 p5073 (1956)). Tne immooilised 

30 polypeptide can then be used to affinity purify t\ --nmbcbies. 

Antibodies to both the generic part of tne fusion polypeptide and the 
part cf interest are produced. When these need oo oe discriminates 
between, a different affinity column can he used to remove only tnose 
a-tibodies specific fcr tne generic part o; tne polypeptide. 

5 5 Alternatively, a:.: a s - e n 1 1 o n e d earlier, it can re arranged tnat tne 
boundary between the t~o separate protern components of the fusion 
polypeptide has ::.» rec com tier, sequence fcr an e no c pert idase witr. a 
rare cutting site. Tr.e pepmce cf interest oar. then oe releasee from 
the affinity pur.fied polypeptide cy tne action cf tne enoopept ic ase 

4 0 (Nagai, K., and Too terser., H . , C. Methods 5nt~cl. 153 p 451-431 (1987). 

A r. o tr.e: alternative is r a . £ e mono clonal ant.Ltj.es again so one o j r _ f i e d 
protein. 
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_a:r,s: 
beneficial wr.er. 
se' se condition, c c 
and used to target 



T r. e ar.t::;j;cs can t e used fcr .oca* isir.g i n situ, 
Sa"?les t k .r:L:r, , fcr e>:a~ple, I ". I S A or HI A assays 
which tr.ey were raised. These uses are particular 
the results c : the assays car. ce correlated tc a d i s 
cancer. . r cr example tumour r.urkers rr.ay be four. 

therapeutic agents. The antibodies can also t e used tc detect cr 
monitor .markers cf undifferentiated growth, infect en, cardiovascular 
or irr-mur.e disease cr a therapeutic response. When the antibodies 
recognise cell surface proteins they can be used in isolation cr m 
combination to isolate particular populations cf cells. These in turn 
can be used to isolate yet more cDNAs which will be enriched for yet 
nore of such surface markers for the population, which, if similarly 
screened, will permit yet further subdivision cf tr.e population. 
Ultimately, panels of antibodies which can describe particular disease 



states will arc 
applications - s 



Such antibodies could be t a : 1 o red f ; 
as ciagnostic purposes and disease r. o : 



: o r e n s i : 



20 



25 



sic- a-.: 



*ne sequence- cr fragments can also L e used for ccr.e::: ar 
mapping, fcr exa.rpie, to diagnose the likelihood chat 
individual is predisposed towards a given genetic disease. In tn 
event *f a segue. _e cc-lccating, genetically, with a disease ccr.e, i 
can be used fcr tne derivation cf new disease theraoies bases uoo 



precise genetic knowledge. Such therapies can 



riuce , 



the t e c h n i g u 
p455-460 (1992)} 



if so-called 



therapy" (Dusty filler, A . 



example, 
mature 3 5? 



35 



40 



Antibodies can ce produced against the protein cf a genetic disease 
with sufficient discriminating power to discriminate ce-twoen diseased 
and non-diseased states (Caskey, 7 . Genome Sequencing Conference, 
Hilton Head, S. Carolina (1591)). This would be useful fcr reducing 
the dependence cf such ^ est. s on nucleic acid-based screens. Such 
antibodies also have the advantace of 
ex cr e s s ir - ~ ' ~ k 



i 1 1 c w i n g cetect.cn c t faulty 
:f the protein, fcr example levels of expression which may 
be important fcr development of the disease in sic. 



onset conditions. 



Also very important is that not all cDNAs are likelv 
conventional means, whereas the present 
"comprehensive". The use of the 



sequences are, 



:iass 



necessity to truncated clones increases the chances tr.at 
will be clc-ec free of any sequencer tnat could ctherWiE. 
from being cloned. Sequence obtained can then ce used * 
primers :rcm ~ i c n tr.e rp~aird-;_ can he cttamed w * *: 
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clone - 



This invention will now be further described and illustrated by means 
of the following Examples. 



on 



All oligonucleotides used in these Examples were synthesised Trityl 
using an AB I 3603 DNA Synthesizer accordin- to the manufacturers 
instructions. Purification was by reverse phase HPLC (see, for 
example, Becker, C, R - , et al., J. Chromatography 326, p293-299 
10 (1985)). 



Example 1 



Human brain and adrenal tissues ware obtained from a mixture of 12 to 

15 15 week menstrual age foetuses and then snap frozen in liquid nitrogen 
before storing in bijou bottles in a -80"C freezer. The two types of 
tissue were used separately, directly from the freezer, to prepare cDNA 
from which restriction fragments were generated for sorting into 
subsets. lg portions of each of the separate tissues were homogenised, 

20 using an Ultra-Turrax T25 Disperser (Janke and Kunkei, IKA- 
Labortechni> ) , on ice ir. the presence of 4M guanidinium isothiocyanate 
to solubilise macromolecules . RNA was isola.ed from each homogenate by 
using cent ri fugat ion to sediment it through caesium tr i f luoroacetate . 
This was performed usinq the Pharmacia kit according to the 

25 manufacturer's instructions, except that centr i f ugat ion was performed 
for 36 hours and the RNA obtained was finally desalted and concentrated 
by performing two ethanol precipitations in succession with two 70% 
ethanol washes after each precipitation. In each case, polyA" (mRNA) 
was isolated from 200 to 4C0 fig of the total RNA by binding it to 

30 magnetic oligo-dT coated beads (Dynal). Solution containing unbound 
material was removed from the beads, which were washed, and then mRNA 
eluted directly for use. mRNA isolation was performed in accordance 
with the manufacturer's instructions. Yields of RNA from the beads 
were between 1 and 3% of the total RNA. 2 to 4 ^g of the eluted RNA 

35 were used for cDNA synthesis. cDNA synthesis was performed according 
to the method of Gubler, U and Hoffman, (B. J. Gene 25 p263 (1983) 
using a Pharmacia kit according to the manufacturer's instructions. 
OligodT was used to prime the first strand cDNA synthesis reaction. 
The cDNA was purified by extracting twice with phenol /chloroform and 

40 then low molecular weight solutes including nucleic acids below ca. 300 
bases were removed by passing the cDNA reaction mixture through a 
Pharmacia S400 spun column used according to the manufacturer's 



WO 94/01548 



PCT/GB93/01467 



instructions. Running buffer fcr the column comprised 10 n\M TrisHCl, 
1 mM EDTA, 5 0 mM NaCl Q pH 7.5. 

The colunn eluate was adjusted to 10 n\M Mg : * and then the purified cDNA 
5 was restricted by the action of 1 unit per 10 pi of the endonuciease 
Fok I at 37 *C for 1 hour, so that it would able to accept adaptors 
to enable fragment sorting. 

The cDNA fragments were purified by two successive phenol/chlorof om 
10 extractions followed by passing them through S400 spun columns as 
described above. 

The adaptors used were oligonucleotides 5' N,N 4 N,N<TCCTTCTCOTGCGACAGACA 
(SEQ ID: 1194) with the complementary strand 5' TGTCTGTCGCAGGAGAAGGA 

15 (SEQ ID: 119S) and 5' AAN^TCTCGGACAGTCCTCCGAGAAC (SEQ ID: 1196) or S* 
TTN,N,TCTCGCACAGTGCTCCGAGAAC {SZQ ID: il97) each with the complementary 
5' biotinylated strand GTTCTCGGAGCACTGTCCGACA (SEQ ID: 1198). These 
were added to 25% of the eluted material by incubating together 200 
pmoles of the mixture of dcublc-stranded adaptors in the elution buffer 

20 to which had been added MgCl : to lOmM, ATP to lOmM and 0.025 units/pl 
of T4 DNA ligase. The oligonucleotide 5' biotinylated 

GTTCTCGGAGCACTGTCCGACA , ( SEQ ID: 1198) and whichever of the 
complementary oligonucleotides with which it was used, each comprised 
1/32 of the molar propcrtion total adaptors. The final rescticn volume 

25 was 90 ul which was heated to 65°C for 3 minutes and then cooled to 
room temperature before the ligase was added. Ligation was performed 
for 16 hours at 12°C. 

Two successive phenol/chloroform extractions were performed to remove 
30 the ligase. The final aqueous phase was passed through an S400 spun 
column (Pharmacia) as described above except that the column was used 
with 10 mM Tris pH 8.3/50 mM NaCl. 

The column eluate was adjusted to 25m>S Kg2 + , 0.5mM dNTPs in a final 
35 volume of 2C0 yl. The mixture was placed in a thermccyc ler (Techne 
MV?2 ) and hected to 7 8 ° C for S minutes. At this point 10 units of 
cloned Tag DNA polymerase (AmpliTaq, Perkin Elmer) were added. This 
was followed by an incubation at 72°C fcr 10 minutes to fill in the 
unligated strand of the adaptor. After the second incubation 200 (jl of 
40 streptavidm coated magnetic beads (Dynal) prepared according to the 
manufacturers instructions were added to bind cDNA ligated to that of 
the oligonucleotides which was complementary to the 5' 
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. GTTCTCGGAGCACTG7CCGAGA { SEC ID: 1198) biotinylated adaptor. Bead 
binding was allowed to p-^ceed at 28°C for 30 minutes with mixing every 
10 minutes. 

5 Un-biot iny lated cDNAs were washed from the beads with 400pl each of 2M 
NaCl twice, fresh 0.15 mM NaOH four times at 28 C C for 5 minutes each, 
water twice and finally a buffer comprising 20 mM Tris pH 8.3, 50 mM 
NaCl, and 25mH Mg : " . The beads were then resuspended in 240 pi cf the 
final buffer including additionally 0.5 mM dNTPs and divided into 4x60 
10 pi. 

Four of the 60 pi aliguots, two from each tissue, were processed 
further specifically to prime and copy a subset of the immobilised, 
adaptored fragments. 2 pmoles of the primer 5' CTr-TCTGTCGCAGGAGAAGGAA 

15 (SEQ ID: 1201) were added to each of two aliquots, one from each 
tissue. 2 pmoles of the primer S' CTGTCTGTCGCAGGAGAAGGAG (SEQ ID: 
1202) were ad^eJ to each of the other two aliquots. 2.5 units of Taq 
DNA polymerase were added to each reaction and 16 cycles of alternate 
denaturation at 95°C for 30 seconds, annealing at 63°C for 2 minutes 

20 and polymerisation at "2°C for 3 minutes was performed to accumulate 
the selecred s ingle -strands in solution. 

On completion of the DNA synthesis reactions a further 30 pi of 
resuspended beads were added to each reaction to remove the 

25 biotinylated fragments. The reaction was incubated at 28°C for 30 
minutes mixing every 10 minutes to ensure that the biotinylated strands 
were bead bound. Each aqueous phase containing the newly synthesised 
strands was then removed and extracted with pher;cl/chlorof orm twice to 
remove the enzyme before being further purified by passing through an 

30 S400 spun column equilibrated with 10 mM Tris pH 8.3/50 mM NaCl as 
described above. 

Rounds of PCR amplification of subsets of the selected fragments were 
performed by using the original primer in each case, together with one 

35 of the primers 5* GTTCTCGGAGCACTGTCCGAGAG ( SEQ ID: 1199) or 5' 
GTTCTCGGAGCACTGTCCGAGAC SEQ ID: 1200). This simultaneously rendered 
the fragments double-stranded and increased the amounts of available 
material. It was not known how many cycles of amplification would be 
required at this stage, since each primer pair would be expected to 

40 behave differently. It was therefore necessary directly to determine 
a suitable number empirically by using standard agarose gel 
electrophoresis to examine the reaction products after a given number 
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of cycles. In some case 1 to avoid the accumulation of non-specific 
products, it was neces / to perform an initial 5 cycles of 
amplification with both o_ the primers present at 2 pmoles each. All 
reactions were performed using 8 pi or 12.5 % whichever was the larger 
5 but not exceeding 12 pi of ti.e column effluent abo/e. Reaction 
conditions were adjusted to 20 n\M Tris pH 8.3, 50 mM NaCl, 25m_M Mg : *, 
0.5mM dNTPs and 2.5 units of Taq DNA polymerase in a final volume of 40 
pi. Apart from when an initial amplification with 2 pmoles of each 
primer was performed, 20 pmoles of each primer were used. Cycles of 
10 amplification were performed at 95°C for 30 seconds, 65°C for 1 minute 
and 72°C for 3 minutes. 

For the purposes of cloning, selected cDNA was amplified as described 
immediately above, except that the reaction was not monitored. 
15 Instead, the number of cycles which had previously been shown to just 
give rise to all observable products plus another 4 cycles were 
performed. In addition, an extra 72°C for 10 minutes incubation was 
performed after * he last cycle. 

20 The products of the reaction were then prepared for directional 
clcni--j. Water was added to adjust the final reaction volume to 60 pi. 
Enzyme w?: removed by two successive phenol /ch loro form extractions. 
The finnl aqueous mixture was passed througn an S400 column as 
described above, except that it had been equilibrated with 10 mM Tris 

2 5 HC1 pH */.5, SCmM NaCl . 

For directional cloning, advantage was taken of tne different known 
sequences Lr.troauced at each end of the selected cDNAs by the adaptors 
in a modification of the method of Aslandis, C. and de Jong, P. J. 

30 (Nucl. Acids Pes. 16, p6156 (1990)). Different cohesive ends were 
produced cn each end by using the excnuclease activity of T4 DNA 
polymerase to resect from the 3* end, to the first T in each case. To 
75 pi or 75 \ of the column eluate, whichever was least, were added 
9.5 pi of 100m.". TrisHCl pH7.4, 100 mM MgC12, and 9.5 pi of 0.5 mM dTT? . 

35 16 units of Ts DNA polymerase were added and the reaction incubated in 
a water bath at 37°C for 30 minutes. The enzyme was removed by 
extracting with phenol/chloroform, twice successively. The salt cf the 
final aqueous phase was adjusted by passing it through an S300 column 
(Pharmacia) equilibrated with 10 r^i TrisHCl pH 7.4, 1 mM EDTA as 

40 described above. 

The E . co I i plasm id cloning vector pBluescript KS* {Altir.g-Meese, M . A. 
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and Short J. M . , Nucl Acids Res. 17 p9494) was prepared for accepting 
the resected cDNA by restriction cleavage at the BamHI and Hindlll 
sites and then adaptoring the resultant cohesive ends using the 
specific adaptors produced by the oligonucleotide 5* AGCTCGGCTCGAGTCTG 
5 (SEQ ID: 1203) with its partially complementary oligonucleotide 5' 
GCGACAGACAGCAGACTCCACCCG (SEQ ID: 1204) and the oligonucleotide 5* 
GATCCGGCTCGAGT (SEQ ID: 1205) with its partially complementary 
oligonucleotide 5* CCGAGAACACTCGAGCCG (SEQ ID: 1206). Preparation of 
the vector and adaptoring were performed according to standard 

10 procedures. Insertion of the cDNA was performed between the BamHI and 
Hindlll restriction sites. Recombinant vectors were transformed into 
the host XLl-Blue (Bullock, W. O. et al Biotechniques 5 p376-378 
( 1987)) by the method of Hannahan, D. J. (Mol. Biol. J 66 p577-580 
(1983)). Suitable standard controls for the ligations and 

15 transformations were also included. 

Post transformation procedures were as described in "Molecular 
Cloning", 2nd Edition (Sambrook J., Fritsch, E. F . , anu Maniatis, T. 
CSH Press (1989)). Colonies were produced by plating onto X-gal/IPTG 

20 L-agar plates containing 50pg/ml d..picillin and 10pg/ml tetracyclin. 
Clear colonies were picked, each into a separate well cf a microtitre 
plate, containing lOOpl of L-broth and 50pg/ml ampicillin. Growth was 
allowed to occur for 16 hours at 37°C. 100^1 of 50% cr 30% glycerol 
was added to plates which were archived at -20°C or -80 C C, 

25 respectively. 

Bacteria corresponding to those archived were used for preparing 
templates for sequencing by the dideoxy method (Sanger, F . Kilklen, S. 
and Coulson, A. R. Proc. Natl. Acad. Sci. 74 p5463-5467 (1377)). 

30 Bacteria for this purpose were either grown on L-agar places containing 
50pg/ml of ampicillin, prepared at the scime time as they had been grown 
in liquid culture, or after plating out from the archive. 
Alternatively, fresh liquid cultures were inoculated from the archive. 
In all cases, cDNA inserts were amplified fcr sequencing by PCR 

35 (Saiki, R. K . et al Science 239 p487-491 ( 1938)). PCR was either 
performed using bacteria directly added to the reaction, by a 
toothpick, or PCR was performed using l/50th of the plasmid isolated by 
preparative methods (Holmes, D. S. and Quigley, M. Anal. Biochem. 114 
pl93 (1981)) from the bacteria in the liquid cultures or from the 

40 plates. 



20 pmoles of each of the PCR primers 5' biot iny lated GTAAAACGACGGCCAGT 
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IS 

(SEQ ID: lie?) and 5' CGA 3TCCACGGTATCG (SEQ ID: 1203) were used :r 
40^1 react; ens containing 1 . Srr.K Mg : \ 50 rvM KC1, Tris-KCl pH S.3 and 
0.25 units cf Amplitaq (Cetus) . Reactions were performed at 95°C, fcr 
1 minute, followed by 35 cycles at 9 5 ° C fcr 30 seconds, 60 ° C fcr 30 
seconds and 72°C for 40 seconds. After tne cycles, a firal incubation 
at 72 C C for 5 minutes was performed. 

After PCS, standard agarose gel electrophoresis was used to determine 
which reactions had been successful. The biotinylated strands cf 
successful reactions were then recovered for single-stranded sequencing 
by binding them to cteptravidin coated beads (Dynal) and then washing, 
all according to the manufacturers instructs is , except that the 
washing steps were either performed manually or p. r formed automatically 
in the 96 well microtitrc plate format using a Biomek robotic work- 
15 station attached to a side-arm loader (3eckm.an). 

Didtoxy cha..-, termination sequencing reactions were performed using the 
immobilise-, biotinylated strands as templates and 2 pm.oles cf the 
oligonucleotide 5' CC AGGTCGACGCTATCG ( SEQ ID: 1209) as primer. 

2C Heacticns were perrormed using fluorescent ly-label led terminators (Du 
Pont* or a f lu oroecein-labelled primer (Pharm.acia) according to the 
manufacturers instructions. Reactions were analysed using automated 
DNA sequencers. A Genesis 2000 was used fcr the "Du Font" reactions 
and an A.L.F. for the "Pharmacia" reactions. Bases were assigned for 

25 tne Genesis 20C0 reads using the manufacturers Rase Caller software. 

Files cf called bases were then transferred co a GUN Network from an 
Apple Macir.-. ^-i computer which had been used for oase calling. Raw 
data from the A.L.F. reads was directly transferred to a SUN networK 
where bases were called using the public domain "trace editor software" 

30 (TED). In both cases, files of called bases were entered into a 
Sybase™ database. Entering data entailed automatically removing vector 
and adaptor or linker sequences, but net editing ambiguous bases. 
After removal of the unwanted bases, files were automatically compared 
co other sequences in the cDNA database and the latest versions cf the 

35 publically available databases, GEN BANK and SWISSPROT. Searches '/ere 
performed with the "basic local alignment search tool" (BLAST) (Karl in, 
S. and Altschul, S. F. Proc. Natl. Acad. Sci. £7 p2264-226S (1990)). 

Sequences SEQ ZD Ncs 1 to 610, given hereinafter, were obtained by the 
40 above procedure. 
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A second method of preparing cDNA libraries for cbt;tin*:_ng gene 
fragments oJ the invention took advantage of the PCR normalisation 
process described aocve. Standard procedures were used „o prepare mRNA 
from RNA that had been isolated by standard caesium chloride bou/ant 
5 density gradient methods from a full term human placenta. The 
oligonucleotide LNotdt, sequence 5* TACGTTCGACAAGCTTGAATTCC^GGCCCC ( T) ^, 
(SEQ ID: 1210) was used at 1 /jM with AHV reverse transcriptase, to 
prime first strand cDNA synthesis under standard conditions from 0.5 ^g 
of the placental mRNA. Temperatures above 65 *C were ured to inactivate 
10 the reverse transcriptase and then the volume of the reaction made up 
to 100 with water. 

PCRs were then performed in reactions containing 1 / 1 of the diluted 
cDNA, 10 mM Tris-HCl /..; 8.3, 40 mM KCl, 1.5 mM KgCl : , 0.01% gelatin, 200 

15 jjM dNTPs / 10 uCi a J: P dCTP, 1 uH each of the primers 11AD1, sequence 5' 
CCC(TA) (GC)CGCCGA ( SEQ ID: 1211), and LNotdT and Taq DMA polymerase. 
An initial dentuuration period of 95 *c for 90 seconds was followed 
either by 35 cycles of standard PCR, comprising 95'C for 30 seconds, 
45'C for 30 seconds and 72 'C for 30 seconds or alternatively 3 cycles 

20 of the standard PCR already described followed by 27 cycles of Cot TCR 
during w^ich an additional step of 72 "C for 16 minutes was placed 
between all of the 95*C and 45*C steps of the standard PCR. The 
standard PCR was followed by a single 72 *C for 3 minutes step while the 
Cot PCR was followed by one standard PCR cycle except that the 72 "C 

25 incubation was performed for 3 minutes. 

Products of the PCR reaction were end repaired by adding 5 units of T4 
DNA polymerase to the reaction and then incurating at 37 "C for 10 
minutes. Enzymes were removed by phenol extraction. The cDNA was 

30 precipitated by 70% ethanol, dried and then resuspended in NotI buffer. 
20 units of NotI were used to digest the cDNA under standard 
conditions. cDNA was again phenol extracted and ethanol precipitated. * 
10% of the purified NotI cut DNA were ligated to the vector pBluescript 
lt ; ng-Meese, M . A. and Short J. K. Nucl. Acid Res. 17 p9494 which had 

35 been prepared as standard to receive this DNA by restricting with the 
enzymes NotI and EcoRV. Transformation and processing of clear 
colonies was performed as described above except that the host E . col i 
strain DH5a was used in place of XL-1 Blue. 



40 



Preparation of clones for sequencing, sequencing and sequence analysis 
of cDNAs in clones thus-produced were performed as described in Example 
1. 
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Sequences SZZ ZO Ncs 611 to 7 72, given hereinafter, were obtained by 
the above procedure. 

Example 3 

5 

cDNA libraries corresponding to adult brain cortex (Clontech 
Laboratories, Inc., Cat No. HL10036) and adult bone narro.* (Clont-ech 
Laboratories, Inc., Cat No. HL105S6) prepared in lambda gtll phage were 
transfected into £ . col i Y1090 and plated out for colour selection of 
10 recombinant plaques ("Molecular Cloning", 2nd Edition Scimbrook J., 
Fritsch, E. F . , and Maniatis, T. CSH Press (1989)). 192 lambda Zap 
clones, corresponding to rhabdomyocarcoma cDNAs and a gift from C. 
Cooper, ICR, Sutton, were similarly plated except that the host XL-1 
Blue was used. 

15 

Clear plaques from each library were resuspended in 5 of Tris-HCl pH 
8, 1 mM ETTA . 2 pi cf the r .Itant phage suspensions were added 
directly to PCRs for the purpos, of amplifying the cDNA inserts for 
sequencing. PCR was performed as described in Example 1, except that 
20 v.he oligonucleotides used as primers for the lambda gtll clones - ere 5' 
GGTGPOJACGACTCCT^GACCCCG ( SEQ ID: 1212) and 5* TTGACACCAG ACCAACTCCTAATG 
(SEQ ID: 1213). Whicheve- of the oligonucleotides was to be used to 
prime the strand which would serve as the sequencing template was used 
in biotinylated form. 



25 



30 



Preparation of clones for sequencing, sequencing and sequence analysis 
of cDNAs in clowes thus-produced was performed as described in Example 
1, except that 2 pc\oles cl the primers that ~ere unbiot iny 1 ated in the 
PCR were us«d as sequencing primers. 

Sequences SEQ ID Nos 773 to 1193, given hereinafter, were obtained by 
the above procedure. 



35 



The following are the SEQUENCE LISTINGS which comprise sequences SEQ ID 
Nos 1 to 12 13 referred to hereinbefore. Certain of these sequences are 
preferred, and are listed as such after tne main SEQUENCE LISTINGS. 
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CLAIMS: 

A nucleic acid fragment encoding a gene product or portion 
thereof and comprising any one of:- 

5 

(a) a sequence selected from SEQ ID Nos 1 to 1193; 

(bj an allelic variation of a sequence as defined in (a); or 

10 (c) a sequence complementary to (a) or (b) . 

2. A nucleic acid sequence as set out in any one of SEQ ID Nos 1 to 
1193, or a complement or allelic variation thereof. 

* 3 3 - A sequence as claimed in claim 2 and which exhibits no more than 
90% homology to a human sequence known per ee. 

4. A nucleic acid fragment comprising a portion of a sequence as 
defined in claim 2 or claim 3 of sufficient size such that a probe of 

20 the same size and exhibiting complementarity to said portion can 
hybridize to said sequence as defined in claim 2 or claim 3. 

5. A fragment as claimed in claim 4, wherein said portion is at 
least IS bases in length. 

25 

6. A fragment p s claimed in any one of claims 1 , 4 or 5 and encoding 
at least a portion of a biologically active polypeptide. 

7. A nucleic acid sequence as claimed in claim 2 or claim 3 and 
30 encoding at least a portion of a biologically active polypeptide. 

8. A DNA construct comprising a fragment as defined in any one of 
claims 1, 4, 5 or 6 or a sequence as defined in any one of claims 2, 3 
or 7, together with a control or regulatory sequence. 
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9. A construct as claimed in claim 8 which encodes a fusion protein 
comprising a known protein and the polypeptide encoded by said fragment 
or sequence. 

40 10- A construct as claimed in claim 9, wherein the fusion protein 
encoded is a cleavable fusion protein having an endopept ida se 
recognition site positioned between codons corresponding to said known 
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protein and said fragment or sequence. 

11. The use of a fragment as defined in any one of c' 5 or 
6 or a sequence as defined in any one of claims 2, 3 c produce a 

5 gene. 

12. A DNA fragment comprising a gene obtainable by the use defined in 
claim 11. 
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13. An expression vector comprising a fragment as defined in any one 
of claims 1, 3, 5 or 6, a sequence as defined in any one of claims 2, 
3 or 7, a DNA construct as defined in any one of claims 8 to 10, or a 
DNA fragment as claimed in claim 12, positioned such that that nucleic 
acid sequence which encodes the polypeptide corresponding to said 
fragment, sequence or DNA fragment is in operable reading frame with a 
control or regulatory sequence. 

14. A vector as claimec i claim 13, wherein said vector control or 
regulatory sequence comprises a regulatable promoter. 

15. Host cells which incorporate as a heterologous part of their 
expressible genetic information a fragment as defined in any one of 
claims 1, 3, 5 or 6, a sequence as defined in any one of claims 2, 3 or 
7, or a DNA fragment as defined in claim 12. 

16. A process for the product ion of a polypeptide comprising 
cultivating host cells as defined in claim 15. 

17. An antibody directed against a polypeptide obtainable by the 
performance of a process as defined in claim 16. 
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An antibody as cl5iT«*d in claim *7 and which is monoclonal. 



19. A novel gene product, or portion thereof encoded by a fragment as 
defined in any one of c loir- 3 1, 3, 5 or 6, or encoded by a sequence as 
defined in any one of claims 2, 3 or 7, or encoded by the gene 
comprised in a DNA fragment as defined in claim 12. 
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